NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Benefits of Early Stopping in Gradient Descent for Overparameterized Logistic Regression

Wu, Jingfeng; Bartlett, Peter; Telgarsky, Matus; Yu, Bin (July 2025, openreview)

Free, publicly-accessible full text available July 19, 2026
Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency

Wu, Jingfeng; Bartlett, Peter L; Telgarsky, Matus; Yu, Bin (July 2024, Proceedings of the 37th Conference on Learning Theory (COLT2024))

Full Text Available
Actor-critic is implicitly biased towards high entropy optimal policies

Hu, Yuzheng; Ji, Ziwei; Telgarsky, Matus (January 2022, International Conference on learning representations)

Full Text Available
Fast Margin Maximization via Dual Acceleration

Ji, Ziwei; Srebro, Nathan; Telgarsky, Matus (July 2021, Proceedings of Machine Learning Research)
null (Ed.)
We present and analyze a momentum-based gradient method for training linear classifiers with an exponentially-tailed loss (eg, the exponential or logistic loss), which maximizes the classification margin on separable data at a rate of O (1/t^ 2). This contrasts with a rate of O (1/log (t)) for standard gradient descent, and O (1/t) for normalized gradient descent. The momentum-based method is derived via the convex dual of the maximum-margin problem, and specifically by applying Nesterov acceleration to this dual, which manages to result in a simple and intuitive method in the primal. This dual view can also be used to derive a stochastic variant, which performs adaptive non-uniform sampling via the dual variables.
more » « less
Full Text Available
Fast margin maximization via dual acceleration

Ji Ziwei; Srebro Nathan; Telgarsky Matus (January 2021, International Conference on Machine Learning)

Full Text Available
Early-stopped neural networks are consistent

Ji, Ziwei; Li, Justin; Telgarsky, Matus (January 2021, Advances in neural information processing systems)

Full Text Available
Fast margin maximization via dual acceleration

Ji, Ziwei; Srebro, Nathan; Telgarsky, Matus (January 2021, international conference on machine learning)

Full Text Available
Social Welfare and Profit Maximization from Revealed Preference

Ji, Ziwei; Mehta, Ruta; Telgarsky, Matus (January 2018, International Conference on Web and Internet Economics)

Full Text Available

Search for: All records